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Abstract 

Measuring culture and its dynamics through surveys has important limitations, but the 
emerging field of computational social science allows us to overcome them by analyzing 
large-scale datasets. In this article, we study cultural dynamics through the votes in the 
Eurovision song contest, which are decided by a crowd-based scheme in which viewers vote 
through mobile phone messages. Taking into account asymmetries and imperfect perception 
of culture, we measure cultural relations among European countries in terms of cultural 
affinity. We propose the Friend-or-Foe coefficient, a metric to measure voting biases among 
participants of a Eurovision contest. To validate how this metric represent cultural affinity 
we designed a model of a random, biased Eurovision contest. Simulations of this model show 
how our metrics can detect negative affinities and serve as an estimator for positive affinities. 
We apply this estimator to the historical set of Eurovision contests from 1975 to 2012, finding 
patterns of asymmetry and clustering in the resulting networks. Furthermore, we define a 
measure of vote polarization that, when applied to empirical data, shows a sharp increase 
within countries of the EU during 2010 and 2011. As a result, we measure how the recent 
political decisions of EU states influence the way their citizens relate to the culture of other 
EU members, leading to stronger cultural biases in the way they vote in the Eurovision song 
contest. 

Keywords: Cultural dynamics; international networks; social simulation 

1 Introduction 

How do cultures evolve? How do they influence each other? These questions are not only central 
to human sciences, like anthropology or ethnology, but play a major role in politics, economics, 
and international relations. Among the scientific tools to study human cultures, agent-based 
modeling provides quantitative insights to culture and opinion dynamics [5]. The original works 
of Axelrod (2j [3] motivate how computational modeling can be used to understand the evolution 
of human cultures. These agent-based models, while being essential simplifications of a more 
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complicated phenomenon, allow us to draw the conditions for the emergence of macroscopic 
social behavior |H [5], such as polarization [14] or clustering [30J, as well as to predict future 
social phenomena |36^ I39| . Furthermore, modeling and simulation can open new questions that 
drive future research, allowing -in an ideal case- a deeper understanding of human societies 
through multidisciplinary research |15j . 

As well as sociological theories need to be empirically testable, computational models of social 
behavior need to be formulated over assumptions that can be verified against empirical data. 
When such behavior is objectively measurable, e.g. economic decisions [6] or voting [8] [26], 
datasets can be produced in a way such that we can directly measure the state of a human. On the 
other hand, when a model includes subjective elements, such as emotions or beliefs, measuring 
the internal states and dynamics of a human becomes a cumbersome task. As an example, 
Axelrod's model introduces the internal state of an agent as a vector of cultural dimensions, 
or opinions, which change according to certain rules |2]. To validate this kind of dynamics, we 
need to be able to measure the subjective internal state of a human, and how it changes when 
interacting with others. While survey data can shed light on opinions and culture [23], there is a 
subconscious component of cultural behavior that cannot be encoded in words |42]. Nevertheless, 
this component can be indirectly measured, for example through physiological responses |18U25] . 
or through online traces such as expression biases |17j . and behavioral patterns in computer 
mediated interaction [16J. 

Quantitative models need to be validated on the dynamics of individual states, but are often 
aimed to reproduce macroscopically observable collective behavior. When addressing cultures or 
societies as a whole, issues of data availability become critical. It can be expensive to query large 
amounts of individuals, limiting the application of subjective reports and surveys. In addition, 
approaching these questions through experimental studies suffers additional problems. For ex- 
ample, experiments cannot reproduce natural exposure in the context of culture and popularity 
[31] . limiting the representativeness of any experimental study. The emerging field of compu- 
tational social science |20| 128] aims at overcoming these limitations, studying human behavior 
through the statistical analysis of large-scale datasets. Such datasets, when available, offer the 
opportunity to validate the macroscopic behavior explained by computational models of social 
interaction. Following the example of Axelrod's model, its validation requires to measure how 
whole cultures change in time, as well as the distances between different cultures. 

In this article, we aim at providing a way to measure the relations between cultures through 
their voting patterns in a set of song contests, in particular looking for biases in the way they 
evaluate each other. This way, we are measuring the dynamics of culture i) at a large-scale level 
usually unreachable for independent research, and ii) measuring subjective biases that are not 
explicitly expressed by the studied individuals. It is of special relevance to measure these kind 
of relations in a timely manner, in order to address possible changes in the relation between 
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pairs of countries. The political decisions of a country, the results of sport events, or the current 
state of the economy might impact the evolution of the public opinion of one society towards 
another. For the case of Europe, the policies of the European Union regarding the debt crisis 
might have an impact on the "state of the union", or how countries within the EU perceive each 
other. Studying data with a time component, we measure how these events play a role in the 
manifestation of cultural relations, with the aim of providing a macroscope tha measures the 
state of the union of Europe at large. 

2 The Eurovision Song Contest 

In this article, we present our study of the relations between European countries through the 
set of results of the Eurovision Song Contest, an annual competition held among the country 
members of the European Broadcasting Union. Every year, each participating country chooses a 
representative artist to compete by performing a song, which is included in a live event broad- 
casted simultaneously in the whole Europe. After the performance of each participant, voting 
countries gather televotes and jury votes [33], creating a local ranking of songs from other contes- 
tants. Afterwards, each voting country publicly announces which other countries receive points 
from 1 to 8, 10, and 12, according to their local rankings. The winner of the contest is the 
country with the song that accumulated the highest amount of points. Extensive and detailed 
descriptions about the contest, its rules, and its history can be found elsewhere |19| I40j . 

While the contest rules and participating countries have changed over the years, this contest 
offers a timely source of cultural evaluations across most European countries. Eurovision has been 
subject of substantial research, up to the point of the usage of the term "eurovisiopsephology" 
|19] . defined as the study of the results of the votes casted in the Eurovision song contest. 
Initial research focused on the possible existence of voting clusters or alliances [40, 41], generally 
due to geographical locations, diaspora effects, language, and religious similarities [38J. Further 
studies combined network analysis with simulations of maximally random contests, revealing how 
Eurovision results have high clustering |12j , which results in voting blocks |19| [34"] , and higher 
chances to win for countries depending on their position in the voting network [9] [37] . 

Since 2004, all the countries participating in the contest choose their votes according to televoting, 
a method that uses phone calls and mobile phone messages of viewers to decide how a country 
votes. Since 2009, these televotes were combined with some expert judges, turning Eurovision 
in an experimental ground to compare popular and expert choices. Recent studies show the 
statistical changes due to televoting [34], while older works measure how expert judges chose 
their votes according to song quality rather than cultural biases |22] , Either way, the results of 
this contest highlight the stable cultural relations between countries [38] , where voting trades or 
game theoretical decisions do not seem to play a role [21] , 
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2.1 Controversies and applications 

Recently, Google set up a Eurovision predictor based on search queries, leading to the correct 
prediction in 2009 and 201CQ . This was discontinued in 2011, after a contested prediction result 
between Lena, the previous German winner who was competing again, and the Irish partici- 
pants called Jedwarcj^] The outcome of this prediction failed, as both countries were defeated 
by Azerbaijan by more than 100 points. In addition, seems that users were exploiting the search 
engine to try to push their country higher in the prediction^] as if searching for your representa- 
tive would increase its chance to win. This reaction to a prediction mechanism shows how social 
systems, as complex adaptive systems, can change their behavior due research results, leading 
to the invalidation of prediction tools or even to self-fulfilling predictions. 

Our approach does not aim to predict contest outcomes or to reveal voting alliances, but to use 
Eurovision as a social macroscope for the relations across European countries. Initial results show 
how Eurovision outcomes can predict international trade [27] , which motivates the measuring 
of the cohesion of European countries and the EU through Eurovision [34J. Popular culture 
and mass media criticize the contest organization, claiming that some countries are treated as 
European only in Eurovision^] as a limitation for a "Europeaniziation process" |24). In addition, 
the contest rules and results are periodically claimed to be unfair, biased^ , or even farcicaj^J 
portraying the contest as a European popularity survey rather than an artistic competition. 
In this article, we precisely aim to measure these biases as relevant quantities, focusing on the 
political, social, and cultural component of the contest rather than on its artistic one. 

2.2 Exploring Eurovision data 

We gathered the whole historical set of Eurovision results from WikipediaQ which contains a 
page for each edition of the contest, and from the official website of the contest]^] 

For each year, we count with a matrix with the values p v ^ c , where each entry corresponds to 
the amount of points given by a country c v to the competing song of another country c c . As 
explained before, p„ )C is contained in the set {0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 12}, and chosen according 



1 calmyourbeans . wordpress . com/2012/05/22/no- google -euro vision- predict or- this -year/ 
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"I'm sick of being European just on Eurosong" https : //www.youtube . com/watch?v=IK8f VHNkOoM 
http : //news .bbc . co .uk/2/hi/uk_news/wales/ south_east/3719157Tstm| 



http : //news .bbc . co .uk/2/hi/ entertainment/6654719 . stm 
For an example of a contest result page, see: http://en.wikipedia.org/wiki/Eurovision_Song_Contest. 
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to the ranking of televotes and jury votes. Our dataset comprises the whole set of results of 
Eurovision editions from 1957 to 2012, including 11775 voting relations between 51 country 
members of the European Broadcasting Union. 

The straightforward approach to understand this data is to look into the voting network formed 
every year |12j . where nodes are participating countries. A directed edge c„ — > c c connects two 
nodes if c v assigned that year a nonzero amount of points to c c . Edge weights are assigned to be 
the amount of points given by the vote, p VjC . The left panel of Figure [T] shows this network for 
the edition of 2008, with edge darkness according to weight, and node darkness proportional to 
the final score s c = ^2 Cv Pv,c of each country c c in the contest. The topological properties of these 
networks have been widely explored, finding symmetrical relations, triadic clustering, and highly 
connected blocks that map to geographically close, and culturally related countries |19 | [37 ] HO]. 




Figure 1: Left: network of votes for the 2008 edition of Eurovision. Nodes are participating 
countries, with color darkness proportional to the final score of the country. Directed edges 
represent the votes given by one country to another, with darker color according to the amount 
of points given by the vote. Right: simulation results with the same size as the 2008 contest, 
visualized in the same manner as the left side. 

Visual inspection of this network, as shown in the left panel of Figure [TJ reveals a significant 
heterogeneity in node darkness. This corresponds to the large deviation of final scores usually 
present in this contest. Initial editions of the contest had multiple draws, so the voting scheme 
was changed to the current one in 1975, in order to encourage the selection of a single winner. The 
resulting heterogeneity is relevant to test the existence of winner-takes-all effects as in cultural 
markets [35], and product reviews [29]. To do so, we calculated the relative score s' c = ^, where 
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T = Y2c c s c i s the total amount of points given in an edition of the contest, which depends on the 
amount of countries participating in a given year. This way we can aggregate all participant scores 
since 1975, as shown in the histogram of Figure [2] Similarly to the cultural markets mentioned 
above, the distribution of s' shows a large variance, and positive skewness. On the other hand, 
the log-log histogram shown in the inset of Figure [2] allows us to notice that there are no scaling 
relations, probably due to the finite size of the contest. We can say that the contest has a large 
variance of final scores, yet these do not allow arbitrarily large values, as opposed to previous 
experience in popularity analysis. We will use these final scores to compare individual votes with 
final results, as explained below. 




i 1 1 1 1 1 1 1 

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 

s 

Figure 2: Relative score s' distribution for contest results from 1975 to 2012. Inset: log-log version 
of the distribution. We use this empirical distribution as input for our simulations. 

3 Measuring cultural relations through Eurovision 
3.1 Perception of culture 

Most agent-based models of culture dynamics include agent interactions based on their internal 
states, usually depending on the distance between the values of their cultural features. While 
valid to reproduce the emergence of opinion groups and cultures [32j, there is still ample room 
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to validate and empirically test the existence of this kind of dynamics. The existence of cultural 
dimensions was first introduced by Hofstede |23j . in a study of surveys across different countries. 
These dimensions were detected by means of dimensionality reduction on the survey responses, 
and have been applied in numerous studies about culture |13] , including a study on Eurovision 
[2 Ij . On the other hand, measuring culture through surveys has clear limitations [I], in particular 
in the interpretation of the meaning of the results of dimensionality reduction. 

Apart from dimensional structures, a key component in models of culture dynamics is the set of 
rules that determine which agents interact and how. While these rules can represent spontaneous 
events of influence between cultures, in other scenarios work as a mechanism in which agents 
perceive the state of others. In a realistic setup, the perception of cultural differences might 
be constrained by imperfect communication, and path dependencies like historical events or 
stereotypes. Such phenomena can shape the way culture is perceived across a society, leading to 
new structures to take into account in future models. For example, with the minimal assumption 
that humans can only perceive a set of dimensions from another culture, the perceived distance 
between cultures could have asymmetric properties. In the schema of Figure [3j we sketch two 
cultures with binary feature vectors of five dimensions. If the left one can only perceive the three 
first features of the other, its perceived Hamming distance would be 1, as they just differ in 
the third feature. At the same time, if the right one can only perceive the last three features 
of the left one, the perceived distance would be 3, leading to asymmetric perception of cultural 
differences. 

Another possibility is that each society might have a reference point, i.e. an expected or "accept- 
able" maximum cultural distance towards another. This would lead to the existence of negative 
cultural relations, which would be a plausible explanation for multiple international conflicts 
present in History. This possibility is commonly ignored when taking into account cultural dis- 
tances in discrete spaces, and might very well be a property of realistic cultural dynamics. 

Given possible asymmetries and signed values, we will define cultural affinity of a society towards 
another as "the differences perceived by the members of a society in relation to the culture of an- 
other society, evaluated as a comparison with a reference point". Cultural affinity takes maximum 
values for very close cultures, and negative values towards very different cultures. By analyzing 
Eurovison results, we want to explore the two properties that differentiate cultural distance from 
cultural affinity, i.e. i) the presence of asymmetry in the perceived differences between cultures, 
and ii) the existence of negative and positive cultural evaluations among European cultures. 

3.2 The Friend-or-Foe coefficient 

To measure cultural affinity, we need to define a way to estimate it from the raw Eurovision 
scores of our dataset. For this, we define the Friend-or-Foe (FoF) coefficient of country c v towards 
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Figure 3: Schema with a possible scenario of feature-based cultures under imperfect perception. 
Each culture is composed of a set of agents with the same values of the cultural vector, shown 
over them. Their asymmetric relation is a result of partial perception of the cultural features of 
the other. 



country c c , as estimated from a particular edition of the contest: 



Fof(c v ,c c 



Pv,c 



Pv 



12(N 



1 



where p v>c are the points assigned to c c by c v , s c is the final score of c c , and N is the total amount 
of countries voting in the studied edition of Eurovision. The first term of the right hand side 
of Equation [T] represents a normalized value of the score given by c v to c c , ranging from for 
no points given, to 1 when 12 points were assigned to c c . The second term corrects for the final 
score of c c in the whole contest, calculating the total amount of points given by other countries 
different than c v . The maximum value of this score is 12(iV — 2), as one country cannot vote 
itself and we have already subtracted c v from the calculation. 

We designed the Friend-or-Foe coefficient to measure the overvoting or undervoting bias from 
a country to another, correcting for "song quality" as estimated by the final contest result |21| . 
This way, we aim at removing the effects of the artistic component of the contest, highlighting 
the political or cultural biases that are commonly claimed to exist in Eurovision. If a country 
c v assigns 12 points to c c , while all the others assign 0, then FoF(c v ,c c ) = 1, which would be 
the maximum value of an overvoting bias. If c v assigns points to c c but all the other countries 
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assign 12, then FoF(c v ,c c ) = — 1, representing the maximally negative Friend-or-Foe coefficient 
given the contest rules. 

After this definition, we need to assess if the FoF is a valid measure to estimate the real cultural 
affinity of one coutry towards another, as described above. In the following, we perform a case 
study of the values of the FoF for different years between pairs of countries with known cultural 
similarities, as well as countries with explicit conflicts. We continue by defining and simulating a 
model for Eurovision contests, including heterogeneous song quality, and an underlying network 
of cultural affinities. By numerical analysis of such model we want to measure the quality of this 
approximation, as the contest rules might distort and limit the quality of the FoF as an estimator 
for cultural affinity. 

3.3 Dyadic relations over time 

For any pair of countries c\ and C2 we can calculate the FoF coefficients between them in each 
contest in which they participated together. The values of FoF(a,C2) and FoF(c2,ci) might 
depend on effects that influence Eurovision votes, such as language and geographical proximity. In 
Figure [4] we present the FoFs for some years between pairs of countries with known similarities and 
conflicts. In the following, we make a case study on how we can use the FoF as an approximation 
for measuring the relations between European cultures, unifying the already known biases in 
previous literature. 

• Cultural proximity. The standard example for the expression of cultural similarity in 
Eurovision is Cyprus and Greece \12\ [19| I34j . Figure shows the FoF between these 
two countries from 2002 until 2012. Both values are positive in each edition of the contest, 
never dropping below 0.3. This way, the FoF is consistent with previous research [38] . where 
cultural proximity and language are major components in the voting trends of Eurovision. 
Note that, while we have a value of FoF {Cyprus, Greece) for every year, some values of 
FoF [Greece, Cyprus) are missing. This is due to the fact that, since 2002, Greece has 
always competed in the final, but Cyprus did not qualify for the final round every year. 

• Asymmetric effects. One of the relations we want to explore is the possible asymmety 
of cultural affinities between the inhabitants of two countries. The FoF pair of Turkey and 
Armenia, shown in Figure [4p is a clear example of an asymmetric relation between coun- 
tries. FoF(Turkey, Armenia) keeps a significantly positive value, representing Armenian 
diaspora living in Turkey. This same effect of 'patriotic voting' was suggested for Turkish mi- 
grants across Europe [38J, and our FoF coefficient reflects it in this case. On the other hand, 
FoF(Armenia, Turkey) is significantly low and mostly below 0. This negative relation is a 
possible expression of negative relations due to historical conflicts between both countries. 
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Another example of asymmetric relations is the Greece and Germany pair, shown in Figure 
|4p. While FoF(Germany, Greece) is significantly positive, the FoF(Greece, Germany) 
is significantly negative. A plausible explanation might be the economic and political rela- 
tions between both countries in the last years, but this asymmetry seems to stretch to past 
times before the creation of the Euro. As an alternative, another possible explanation is 
the existence of perceived cultural asymmetry, in a way such that German inhabitants feel 
a closer to Greek culture, or a diaspora effect caused by Greek immigrants in Germany. 

• Negative relations. The second type of cultural relation we want to explore is the pos- 
sibility of negative relations between pairs of countries. Couples of countries with explicit 
territorial conflicts show this way symmetric negative FoFs, as shown in Figure |4j3 and|4^. 
Turkey and Cyprus have diplomatic conflicts regarding the status of Northern Cyprua^J and 
Armenia and Azerbaijan are still officially in war since the Nagorno-Karabakh conflict)^] 
This negativity is evident in their FoF coefficients, as these countries consistently avoid 
voting each other. But political conflicts might not be the only reason for negative FoFs, 
as illustrated in the example of Greece and Norway on Figure [4p. We do not have any 
plausible assumption for this negative pair rather than the mere cultural distance between 
one of the northmost and one of the southmost European countries. This way, the FoF 
coefficient would be a valid estimator for possible negative cultural relations, in the case 
that two countries consider each other too far in terms of cultural distance. 



3.4 A model for biased Eurovision contests 

The above case study shows that the Friend-or-Foe coefficient consistently represents previous 
findings and types of symmetric, asymmetric, as well as positive and negative cultural relations. 
Yet this qualitative study does not allow us to determine the actual relation between the FoF and 
any underlying, quantifiable, cultural affinity. In the following, we propose a model to simulate 
Eurovision contests under heterogeneous song quality and the influence of cultural affinity. Our 
aim is to study the relation between the FoF as manifested through contest simulations, given a 
network of affinities between countries. 

Our model receives as an input a network in which nodes represent countries, connected by edges 
with weights that represent cultural affinities, i.e. a measure that takes high positive values for 
culturally similar countries, and negative values for highly dissimilar cultures. Thus, an edge e vc 
connecting the node of country c v to the node of country c c has a weight w vc that measures the 
cultural affinity of c v towards c c . This network is composed of two subnetworks, with participant 
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Figure 4: Time evolution of Friend or Foe coefficients between selected pairs of countries. The 
example of Greece and Cyprus show a symmetric positive relation, possibly due to cultural 
similarity. On the other hand, the example of Greece and Norway shows a symmetric negative 
relation, possibly due to cultural differences. The case of Greece and Germany shows an asym- 
metric relation, where Germany shows significantly larger FoF towards Greece than vice versa. 
The right column shos possible political conditions of the FoF, where country pairs with re- 
gional conflicts have symmetric negative FoF (Cyprus- Turkey, Armenia- Azerbaijan), or diaspora 
effects create asymmetries like the Turkey-Armenia pair. The horizontal dashed lines represent 
the baselines calculated in Section I3.5I 

and only voting countries. The subnetwork between participants is fully connected, directed, 
weighted network, and the only voting countries are connected to all the participant ones by 
unidirectional weighted links. 

At the beginning of a simulation, we assign a quality value q to each participant, sampled 
uniformly at random from the distribution of rescaled scores s' c shown in Figure [2] This value is 
an approximation of the artistic quality of a song |21] . which is supposed to determine the final 
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outcome 7of the contest. A simulation of the model is composed of two steps: 

1. Each country a constructs the ranking of the other participant countries, by computing a 
value tj that is a function of the weight w%j of the edge ejj, and the quality of the song, 
qj. Our initial assumption is that the form of this function is 



This way, f(w,q) is a linear combination of w and q, with a proportionality factor a. 
This function represents the combination of jury votes and televotes, as empirical studies 
show that the jury is more influenced by the artistic quality of a song than the televotes 
|22| . which seem to be driven by geographical and cultural biases. The current rules of the 
contest give the same weight to both votes, so we will choose a = 0.5 for our simulations. 

2. Given the rankings of each node, the agents cast their votes in order, assigning them 
according to the voting scheme of Eurovision, awarding a set of points from 1 to 8, then 
10, and finally 12 to the neighbor with the top value in the ranking. 

As an initial assumption, we take edge weights Wij sampled from a uniform distribution with 
Wmin = — 1 and w max = 1. This way, simulations of our model can equally span positive and 
negative values of cultural relations, with possible asymmetries. 

The output of the model is an artificial voting result that can be compared with the real world 
data. The right panel of Figure [T] shows the outcome of a simulation, displayed with the same 
method as the empirical data of the left panel. Both networks show similar properties of degree 
distribution and density, but the purpose of our simulations was not to reproduce topological 
properties of the contest results. In the following section, we present an analysis of the relation 
between simulated FoFs and edge weights, exploring how to estimate cultural affinities through 
Eurovision results. 

3.5 Numerical analysis of the Friend-or-Foe coefficient 

The countries participating in Eurovision cannot distribute the points they give in an arbitrary 
way, they should follow rules that constrain these values as explained above. Under this restric- 
tion, it follows to ask about the possible distortion that these rules create on the estimation of 
cultural affinities, as calculated by the Friend-or-Foe coefficient. After a simulation of our model 
for biased Eurovision contests, we can analyze the relation between the weight of an edge Wij, 
and the FoF calculated on the simulation result. Figure [5] shows a scatter and barplot of the FoF 
in a simulation of our model as a function of edge weight. There is a linear relation for weights 




(2) 
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above zero, while the FoF seems to keep a constant value for negative weights. To quantify this, 
we calculated a linear regression of the form 

FoF(eij) = hwij + b (3) 

for values of Wij above 0. The dashed diagonal line of Figure [5] shows the regression results, where 
bi = 0.93593, and b = -0.39460, with R 2 = 0.75 and p-values below 10" 15 . The horizontal 
dashed line shows the mean FoF of negative weights, —0.31 



CO 

o 




Figure 5: Scatter plot of cultural affinities and FoF for a simulation of the model. The barplot 
shows mean and standard deviation of FoF in bins of width 0.05. The dashed line shows piecewise 
regression results, showing the proportional effect of positive weights on the FoF, versus the 
negative constant baseline of negative weights. 

The relation shown in Figure [5] suggests that the Friend-or-Foe coefficient is a good estimator for 
positive cultural affinities, if it is above certain value. Below a threshold, the FoF can be used as 
a way to detect negative affinities, but it does not allow us to estimate their magnitude. To apply 
these results to empirical data from Eurovision we need to find the value Q neg below which we 
can conclude that a FoF represents a negative relation, and the value pO s over which we can 
estimate a positive positive relation through the FoF. For this decision problem, we define the 
following classification rule: 
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FoF(e id ) > Gp OS 


then 


w i:j 


> 


else if 


FoF(eij) < Q neg 


then 


Wi,j 


< 






else 


w id 


= 



(4) 

Each possible set of values for Q pos and Q neg defines two discriminants that map the FoF coef- 
ficients to the sign of the affinity between pairs of countries Wij. Given this prediction, we can 
classify each pair given its FoF, having certain amounts of correctly and incorrectly classified 
relations. This way, for a given value of Q p0 s, we compute the values of precision and recall |33j 
over our simulation as 

tp tp 

Precision = — Recall = — (5) 

tp + jp tp + fn 

where tp and fp are the amounts of true and false positives, i.e. pairs with FoF(eij) > Q pos 
that had Wij > for tp, and Wij < for fp. The amount of false negatives fn is the count of 
pairs with FoF(eij) < Q pos that had Wij > 0. In an equivalent manner, we compute precision 
and recall for Q neg , inverting the rules of Equation [5] To independently assess the quality of the 
discriminant given a value for each threshold, we compute the F\ scores combining precision and 
recall in a unidimensional metric 1 33 1 



p _ 2 Precision pos ■ Recall pos ^ _ ^ Precision neg ■ Recall neg 
Precision pos + Recall pos neg Precision neg + Recall neg 

Figure [5] shows the value of F pos and F neg as a function of the value of the thresholds @ pos arid 
Qneg- Both classification rules have maximum Fi values for Q pos = —0.2 and Q neg = —0.24. 
These F\ are above 0.8, and G pos > Q n eg, providing us with a valid and consistent method to 
determine the sign of a cultural relation given its FoF in a contest. The dashed horizontal lines of 
Figure|4]are placed at —0.2 and —0.24, to guide us in deciding negative affinities between couples 
of countries. Using the inverse form of Equation [3j we can formulate the following method to 
estimate the affinity of one country for another from the FoF in a given contest: 



if FoF(eij) > Q pos then w i:j = FoF ^ e ^ — h. 

bo 

else if FoF(eij) < Q neg then Wij < 

else Wij = (7) 

where the values of bo, b\, Q pos , and Q neg have been estimated from computer simulations as 
explained above. 
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Figure 6: F values for negative and positive weight classifications as a function of the chosen 
threshold. Vertical lines indicate the maximum points for both, at Q pos = —0.2 and Q neg = —0.24 



4 Analysis of Cultural Affinity in Eurovision 
4.1 The network of affinity among European countries 

The analysis of simulated biased contests allowed us to define the estimator of Equation [7J which 
lets us detect negative affinities between countries, and estimate values of positive affinities. In 
this section, we will apply this rule to the FoF coefficients across several contests, analyzing the 
properties of the network of cultural affinity between European countries. 

For each pair of countries c c and c v , we have the FoF coefficients in the contests where they 
appeared FoF t (e CtV ), where t denotes a year where both countries participated. We apply Equa- 
tion [7] to convert this time series of FoFs into a time series of affinity estimations w*. v . First, 
we summarize the history of cultural affinities of c c towards c v with two metrics, the average 
positive affinity 
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and the ratio of negative affinities 

N c , v = ^(l-e[wl v }) (9) 
t 

where T is the length of the studied time period, and Q[w] is the Heaviside step function, with 
value 1 if w > and otherwise. These two metrics allow us to aggregate the historical relations 
between European countries, as estimated from the Eurovision results in a certain time period. 
Each metric defines a network between countries, one with average positive affinities as weights, 
and another one with frequencies of negative affinites as weights. We show these two networks 
for the period 2007-2012 in Figure [7j with edge width and darkness according to edge weight 
{P c ,v an d N c<v respectively), and node darkness according to node strength, i.e. the sum of the 
weights of incident edges. 

The left panel of Figure [7] shows the network of positive relations displayed with the weighted 
ARF algorithm of the Cuttlefish Network Workbench ^ which highlights node clusters. This 
confirms previously found voting coalitions |12| . I19 | IM]. including a Balkan cluster of former Yu- 
goslavian republics, a Scandinavian cluster, and the pairs Cyprus-Greece and Moldova-Romania. 
In addition, Turkey has a significantly high strength, which confirms previous conjectures of the 
effect of Turkish diaspora |38]. Among all the countries studied in this period, the strongest 
one is Belgium, which to our knowledge was never found to play a special role in this contest. 
Its possible mediator role in the European Union, combined with immigration from all across 
Europe might be the reason that drives Belgium to this privileged position. 

The right panel of Figure [7] shows the network with negative affinity ratios as edge weights, with 
node darkness proportional to strength calculated as the sum of weights of incident edges. Using 
the same layout algorithm as for its positive counterpart, there is no clear community structure 
in this negative network. A possible explanation for this lack of clustering is the balanced nature 
of signed networks |10j . which naturally avoids negative triangles. The strongest nodes in this 
network are Greece, Azerbaijan, Turkey, and Armenia, all involved in a series of territorial and 
political conflicts with each other. While we quantitatively test for the reasons for the structure of 
these networks, the FoF coefficient allows us to quantify the cultural affinities between European 
countries. 

4.2 Time evolution of cultural affinities 

While the networks of Figure [7] show the time aggregates of cultural relations among European 
countries, we want to measure how these networks evolve in time. To do so, we compute the time 



www. cuttlefish. sourcef orge .net 
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Figure 7: Left: network of positive FoF averaged between the years 2007 to 2012. Edge color, 
weight, and width are proportional to the average FoF from one country to another. Node 
darkness corresponds to the total sum of incoming edge weights. Right: Network of counts of 
negative FoF for the same time period as in the left. Edge width, weight, and color is proportional 
to the amount of negative instances of FoF from one country to another, and node darkness is 
proportional to the sum of incoming edge weights. 



series of network asymmetry, defined as 

asymmetry(t) = — 



w 



c.v 



(10) 



where E is the total amount of edges, i.e. pairs of voter-competing countries in the network. The 
rationale behind this metric is to estimate how strong are the typical asymmetries illustrated in 



Section 3.3 by computing the average distance between reciprocal pairs of estimated affinities. 
In addition, we also calculate the time series of network polarization as 



polarization(t) 



which is essentially the standard deviation of estimated affinities across all the edges in the 
network. This polarization metric takes higher values when cultural and political biases are 
strong, in comparison to the artistic component of the contest. If all countries agreed on the best 
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songs in the same manner, the polarization would have a value close to zero, as the resulting 
FoFs would lead to moderate values of w as calculated in Equation [7] 
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Figure 8: Time series of asymmetry and polarization in the Eurovision editions since 1975. 
Asymmetry shows a negative trend in the last two decades, but keeping always above 0.2. 



As mentioned in Section 2.1, the organization of the Eurovision song contest has been accused 
of having an increasing level of unfairness and lack of artistic content. If that is the case, voting 
clusters should decrease asymmetry, and political and popularity effects should create a hetero- 
geneity in the votes that increases polarization. To test those hypotheses, we created the time 
series of asymmetry and polarization for each contest edition, shown in Figure [8] These were 
calculated on the affinity estimations over the each year from 1975 to 2012. From 1990, there is 
a negative trend in asymmetry that leads to more symmetrical contest outcomes towards 2012, 
while polarization does not seem to follow this pattern. This decreasing asymmetry indicates 
that countries tend to vote each other as the other one voted them before. This increasing reci- 
procity can be built up along many interactions, where the already known cultural clusters form 
and strengthen, due to voting dependencies as well as the changes on the political landscape 
of Europe. On average, the overall asymmetry of the contest has a value of 0.27, giving us a 
quantization of the cultural reciprocity between the countries of Europe. 

Figure [8] also shows a lack of a significant trend in polarization, meaning that, at this high level 
of aggregation, we cannot detect any pattern of political influence in the contest outcome. This 
does not rule out the possibility of increasing polarization for groups of countries, or around 
individual participants, which we discuss in the following section. 
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5 Measuring International Relations in the EU 

One of the main motivations for studying Eurovision data is its representativeness of the whole 
Europe, giving the possibility to study the relations among countries. It is particularly relevant 
to study if the events of the financial crisis across members of the European Union can be traced 
through our statistical measures, looking for a macroscope of the state of the union in the EU. 
In this section, we focus on two subsets of countries of the EU: the 12 founding members of the 
Eurozone, and the EU-15, which is the set of members of the European Union since 1995. For 
each subset we compute the value 

1 

|G| 



P G {t) 



12) 



{c,v)dG 

which is the average positive affinities between the studied countries G. 

Figure [9] shows the time series of P for the countries in the Eurozone, in the EU-15, and in the 
whole European Broadcasting Union. In 2010 and 2011, P shows a significantly higher value for 
the Eurozone and EU-15 countries than in previous years, where it was not easily distinguished 
from the rest of the contest. To explore this comparison between a subset of countries and the 
whole Europe, we computed the Z-score of P as zo(t) = Pg ^~ p ^ where Pit) is the mean and 
a pos {t) is the standard deviation of positive weights for all Europe in year t. Figure 10 shows the 



time evolution of z(t), for both the Eurozone and the EU-15. We can appreciate the fluctuation 
above zero in 2001 and 2011, as well as a fluctuation below zero in 2007. 
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Figure 9: Time series of P from 1998 to 2012, for countries in the Eurozone, in EU-15, and all 
countries. 

These Z-scores keep below 0.3, showing an anecdotal fluctuation rather than a consistent result. 
Nevertheless, they indicate the presence of a pattern that could be measured with topological 
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Figure 10: Time series of the Z-score of P per year since 1998, comparing the distribution of P 
in the whole contest versus the value of P for countries in the Eurozone and in EU-15. 



metrics that also include negative weights, such as the asymmetry and polarization of Equa- 
tion 10 and Equation 11 respectively. Focusing on the EU-15, we calculated asymmetry (t) and 



polarization(t) for the subnetwork formed by the 15 countries of EU-15, which are shown from 
1998 to 2012 in Figure 11 These two metrics show a strong fluctuation in 2010 and 2011, the 
first two years of the debt and austerity measures across the studied countries. The increase 
in asymmetry reflects the lower cohesion among countries in the EU-15, which voted outside 
their usual geographical clusters. This effect is also present when analyzing the Eurozone, clearly 
influencing countries related to the common currency. The increase in polarization shows that 
their votes where less homogeneous and their biases led to strong FoFs, reflecting an influence 
exogenous to the artistic component of the contest. 



Figure 12 shows the EU-15 subnetworks for the contest editions between 2009 and 2012. Blue 
edges represent positive affinities with a width and darkness proportional to the estimated affinity 
of a country towards the other. Red edges represent negative affinities of a constant width as we 
cannot estimate their magnitude. These visualizations allow us to conjecture about the origin 
of the changes in asymmetry and polarization after the crisis started, in particular in 2011. The 
density of the network is much higher in 2011, with a large amount of negative affinities, but 
also with stronger positive ones. The ARF layout algorithm highlights a network division which 
includes countries in debt on one side (Spain, Portugal, Italy, Greece), and countries not yet in 
crises on the other (Netherlands, Germany, Sweden, Finland), with the exception of Ireland in 
this second cluster, and the UK as a bridge between both. 

While we cannot precisely quantify the dependence of the debt crisis and the metrics of asymme- 
try and polarization, our observations indicate a significant change in 2010 and 2011, coinciding 
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Figure 11: Time series of asymmetry and polarization among the countries in EU-15, between 
1998 and 2012. A strong peak is present in 2010 and 2011, coming back to normal values in 2012. 

with the loans and austerity measures in Portugal, Ireland, Italy, Greece, and Spain. In addition, 
these two metrics where quite stable since 1998, the year when televoting was introduced in 
Eurovision, only having such change in 2010 and 2011. 

6 Conclusions 

We have studied the cultural relations among European countries through the behavioral biases 
present in the Eurovision song contest. To do so, we gather a dataset of the historical contest 
outcomes, which aggregate the votes of large amounts of viewers who simultaneously vote by 
sending mobile phone messages. Our approach is centered around the statistical analysis this 
large-scale dataset, producing metrics that compose a macroscope of the cultural cohesion in 
Europe at large. 

The first metric defined here is the Friend-or-Foe coefficient, a metric that reveals asymmetric 
positive and negative relations between European countries. We validated how this metric esti- 
mates underlying cultural affinities against a model of biased Eurovision contests. This allowed 
us to find a statistical rule to detect negative affinities, and a way to estimate the value of positive 
affinities given the outcome of a contest. Applying such rule, we could extract the networks of 
positive and negative relations across countries, which reveal known clusters and dyadic rela- 
tions. Over this network data, we designed metrics of asymmetry and polarization, which detect 
changes in the voting patterns among participating countries. Applying these metrics to the 
votes between countries in the EU-15, we find a significant change in 2010 and 2011, the most 
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Figure 12: Affinity networks between the EU-15 countries for the 2009, 2010, 2011, and 2012 
editions of Eurovision. Each node represents a country and each directed edge connects two 
countries if their affinity was found different than 0. Blue edges represent positive affinities with 
width and darkness proportional to the estimated affinity. Red edges denote instances of negative 
affinities. All the networks where plotted with the weighted ARF layout algorithm, as explained 
above. 

turbulent years with respect to debt and austerity measures in the EU. These suggest, in turn, 
that the political climate on the EU can influence the perception of culture across countries, and 
economic decisions of the involved states can change the way societies relate to each other in the 
EU. 

Additional datasets are available to extend our results. Wikipedia also contains the results of 
the semifinals of the contest, which can be used to refine the Friend-or-Foe coefficient as an 
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aggregation of all the available data. Furthermore, our structural analysis can be combined with 
statistics of online behavior, such as measures of search queries, website visits, or amounts of 
views and comments for the Youtube videos of participants. These extended datasets, in turn, 
could be used to create testable predictors for the outcome of Eurovision contests. 

A clear limitation of our analysis is the country level restriction given the data provided by 
the Eurovision song contest. Cultures do not need to map to countries, as ethnic minorities or 
pan-state cultures are neglected in this analysis. This limitation is present in the current state- 
of-the-art studies [T], waiting for sources of cultural data at different levels of aggregation. Our 
estimation of cultural affinity through the Friend-or-Foe coefficient depends on the way these 
two relate to each other, which we explored through model simulations. In these simulations, we 
have chosen a uniform distribution of the affinities between -1 and 1, on an unbiased random 
network. For the purpose of completeness, future work should consider other distributions (for 
example, normal distribution centered around with some deviation) and other mechanisms of 
link formation (e.g. reciprocity and clustering). The nature of this network, as well as the rules 
to decide votes according to song quality, are empirically testable, and can be validated if more 
precise data becomes available. 

The modeling and analysis approach presented here can be applied to other contests, for example 
in artist popularity competitions [7], or beauty contests. The Friend-or-Foe coefficient can be 
adapted to other contest schemes, looking for alternative support on the way cultural affinity 
creates voting biases. Finally, the metrics of positivity, asymmetry, and polarization compose a 
"European mood" metric, which we have shown to be related to the economical and political 
decisions of the European Union, and its member states. Applying this metric to future contests, 
we can investigate how political decisions influence the state of cultural cohesion between the 
inhabitants of European countries. 
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